KMID : 0917520070140040131
|
|
Journal of Speech Sciences 2007 Volume.14 No. 4 p.131 ~ p.144
|
|
Improvements on MFCC by Elaboration of the Filter Banks and Windows
|
|
Lee Chang-Young
|
|
Abstract
|
|
|
In an effort to improve the performance of mel frequency cepstral coefficients (MFCC), we investigate the effects of varying the parameters for the filter banks and their associated windows on speech recognition rates. Specifically, the mel and bark scales are combined with various types of filter bank windows. Comparison and evaluation of the suggested methods are performed by two independent ways of speech recognition and the Fisher discriminant objective function. It is shown that the Hanning window based on the bark scale yields 28.1% relative performance improvements over the triangular window with the mel scale in speech recognition error rate. Further work on incorporating PCA and/or LDA would be desirable as a postprocessor to MFCC extraction.
|
|
KEYWORD
|
|
MFCC, Bark Scale, Filter Bank Window, Speech Recognition
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|